Picture for Hongfei Xue

Hongfei Xue

WenetSpeech-Wu: Datasets, Benchmarks, and Models for a Unified Chinese Wu Dialect Speech Processing Ecosystem

Add code
Jan 16, 2026
Viaarxiv icon

Crystal Generation using the Fully Differentiable Pipeline and Latent Space Optimization

Add code
Jan 08, 2026
Viaarxiv icon

Lifelong Domain Adaptive 3D Human Pose Estimation

Add code
Dec 29, 2025
Viaarxiv icon

The TEA-ASLP System for Multilingual Conversational Speech Recognition and Speech Diarization in MLC-SLM 2025 Challenge

Add code
Jul 24, 2025
Viaarxiv icon

AI-Assisted Rapid Crystal Structure Generation Towards a Target Local Environment

Add code
Jun 09, 2025
Figure 1 for AI-Assisted Rapid Crystal Structure Generation Towards a Target Local Environment
Figure 2 for AI-Assisted Rapid Crystal Structure Generation Towards a Target Local Environment
Figure 3 for AI-Assisted Rapid Crystal Structure Generation Towards a Target Local Environment
Figure 4 for AI-Assisted Rapid Crystal Structure Generation Towards a Target Local Environment
Viaarxiv icon

Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR

Add code
May 28, 2025
Figure 1 for Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR
Figure 2 for Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR
Figure 3 for Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR
Figure 4 for Delayed-KD: Delayed Knowledge Distillation based CTC for Low-Latency Streaming ASR
Viaarxiv icon

Selective Invocation for Multilingual ASR: A Cost-effective Approach Adapting to Speech Recognition Difficulty

Add code
May 22, 2025
Viaarxiv icon

Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning

Add code
Apr 29, 2025
Figure 1 for Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning
Figure 2 for Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning
Figure 3 for Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning
Figure 4 for Enhancing Non-Core Language Instruction-Following in Speech LLMs via Semi-Implicit Cross-Lingual CoT Reasoning
Viaarxiv icon

DanceMosaic: High-Fidelity Dance Generation with Multimodal Editability

Add code
Apr 06, 2025
Viaarxiv icon

HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models

Add code
Mar 24, 2025
Figure 1 for HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models
Figure 2 for HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models
Figure 3 for HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models
Figure 4 for HOIGPT: Learning Long Sequence Hand-Object Interaction with Language Models
Viaarxiv icon